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Abstract 


In many applications involving epistemic uncertainties usually modeled by belief functions, it 
is often necessary to approximate general (non-Bayesian) basic belief assignments (BBAs) 
to subjective probabilities (called Bayesian BBAs). This necessity occurs if one needs to 
embed the fusion result in a system based on the probabilistic framework and Bayesian 
inference (e.g. tracking systems), or if one needs to make a decision in the decision making 
problems. In this paper, we present a new fast combination method, called modified rigid 
coarsening (MRC), to obtain the final Bayesian BBAs based on hierarchical decomposition 
(coarsening) of the frame of discernment. Regarding this method, focal elements with prob- 
abilities are coarsened efficiently to reduce computational complexity in the process of com- 
bination by using disagreement vector and a simple dichotomous approach. In order to 
prove the practicality of our approach, this new approach is applied to combine users’ soft 
preferences in recommender systems (RSs). Additionally, in order to make a comprehen- 
sive performance comparison, the proportional conflict redistribution rule #6 (PCR6) is 
regarded as a baseline in a range of experiments. According to the results of experiments, 
MRC is more effective in accuracy of recommendations compared to original Rigid Coarsen- 
ing (RC) method and comparable in computational time. 


Introduction 


The theory of belief functions, known as Dempster-Shafer Theory (DST) was developed by 
Shafer [1] in 1976 from Dempster’s works [2]. Belief functions allow one to model epistemic 
uncertainty [3] and they have been already used in many applications since the 1990’s [4], 
mainly those relevant to expert systems, decision-making support and information fusion. To 
palliate some limitations (such as high computational compelxity) of DST, Dezert and Smar- 
andache proposed an extended mathematical framework of belief functions with new efficient 
quantitative and qualitative rules of combinations, which was called DSmT (Dezert and Smar- 
andache Theory) in literature [5, 6] with applications listed in [7]. One of the major drawbacks 
of DST and DSmT is their high computational complexities, on condition that the fusion 
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space (i.e. frame of discernment—FoD) and the number of sources to combine are large. 
DSmT is more complex than DST, and the Proportional Conflict Redistribution rule #6 
(PCR6 rule) becomes computationally intractable in the worst case as soon as the cardinality 
of the Frame of Discernment (FoD) is greater than six. 

To reduce the computational cost of operations with belief functions when the number of 
focal elements is very large, several approaches have been proposed by different authors. Basi- 
cally, the existing approaches rely either on efficient implementations of computations as pro- 
posed for instance in [8, 9], or on approximation techniques of original Basic Belief 
Assignment (BBA) to combine [10-14], or both. From a fusion standpoint, two approaches 
are usually adopted: 1) one can approximate at first each BBA in subjective probabilities and 
use Bayes fusion rule to get the final Bayesian BBA [11, 12], or 2) one can fuse all the BBAs 
with a fusion rule, typically Dempster-Shafer’s, or proportional conflict redistribution rule #6 
(PCR6) rules (which is very costly in computations), and convert the combined BBA in a sub- 
jective probability measure [10, 14]. The former method is the simplest method but it gener- 
ates a high loss of information included in the original BBAs, whereas the latter method is 
intractable for high dimension issues. 

This paper presents a new combination method, called modified rigid coarsening (MRC), 
to get the final Bayesian BBAs based on hierarchical decomposition (coarsening) of the frame 
of discernment, which can be seen as an intermediary approach between the two aforemen- 
tioned methods. This hierarchical structure allows to encompass bintree decomposition and 
mass of coarsening FoD on it. To prove the practicality of our proposed method, MRC is 
applied to combine users’ preferences so as to provide the suitable recommendation for RSs. 
Preliminary work on original rigid coarsening (RC) has been published in our recent work 
[15] (This is an extended version of the paper presented at the 20th IEEE International Confer- 
ence on Information Fusion, XIAN, China). In this paper, more detailed analyses of this new 
combination method are provided. More importantly, this innovative method is also applied 
into the real application. These are all added values (contributions) of this paper. 

The main contributions of this paper are: 


1. the presentation of the FoD bintree decomposition on which will be done the BBAs 
approximations; 


2. user preferences in Recommender Systems (RSs) are modeled by DSmT-Modeling 
Function. 


In order to measure the efficiency and effectiveness of the MRC, it is integrated in the RSs 
based on DSmT and compared to traditional methods in the experiments. The results show 
that regarding the accuracy of recommendations, MRC is extremely close to classical PCR6; 
and the computational time of MRC can be obviously superior to that of PCR6. 

The remainder of this paper is organized as follows. In section 2, we review relevant prior 
work on DST and DSmT first. In section 3, MRC is presented. In section 4, a recommendation 
system based on DSmT, that employs MRC to combine users’ preferences, is shown. In section 
5, we evaluate our proposed algorithm based on two public datasets: Movielens and Flixster. 
Finally, we conclude and discuss future work. 


Mathematical background 


This section provides a brief reminder of the basics of DST and DSmT, which is necessary for 
the presentation and understanding of the more general MRC of Section 3. 

In DST framework, the frame of discernment (Here, we use the symbol £ to mean equals 
by definition.) © £ {0,,...,6,} (m > 2) is a set of exhaustive and exclusive elements 
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(hypotheses) which represents the possible solutions of the problem under consideration and 
thus Shafer’s model assumes 0;N 6; = ) for i Æ j in {1, .. ., n}. A basic belief assignment (BBA) 
m(-) is defined by the mapping: 2° > [0, 1], verifying m() = 0 and Z4e20 m(A) = 1. In DSmT, 
one can abandon Shafer’s model (if Shafer’s model doesn’t fit with the problem) and refute the 
principle of the third excluded middle. The third excluded middle principle assumes the exis- 
tence of the complement for any elements/propositions belonging to the power set 2°. Instead 
of defining the BBAs on the power set 2° £ (@, U) of the FoD, the BBAs are defined on the 
so-called hyper-power set (or Dedekind’s lattice) denoted D? £ (O, U , N ) whose cardinalities 
follows Dedekind’s numbers sequence, see [6], Vol.1 for details and examples. A (generalized) 
BBA, called a mass function, m(-) is defined by the mapping: D? = [0, 1], verifying m() =0 
and Yacpe m(A) = 1. The DSmT framework encompasses DST framework because Iepe, 
In DSmT, we can take into account also a set of integrity constraints on the FoD (if known), by 
specifying all the pairs of elements which are really disjoint. Stated otherwise, Shafer’s model is 
a specific DSm model where all elements are deemed to be disjoint. A € DÌ is called a focal ele- 
ment of m(.) if m(A) > 0. A BBA is called a Bayesian BBA if all of its focal elements are single- 
tons and Shafer’s model is assumed, otherwise it is called non-Bayesian [1]. A full ignorance 
source is represented by the vacuous BBA m,(@) = 1. The belief (or credibility) and plausibility 
functions are respectively defined by Bel(X) £ J yepejycx m(Y) and 
PI(X) £ J yepojynxzo M(Y). BI(X) £ [Bel(X), PI(X)] is called the belief interval of X. Its length 
U(X) £ PI(X) — Bel(X) measures the degree of uncertainty of X. 

In 1976, Shafer did propose Dempster’s rule and we use DS index to refer to Dempster-Sha- 
fer’s rule (DS rule) because Shafer did really promote Dempster’s rule in in his milestone book 
[1]) to combine BBAs in DST framework. DS rule is defined by mps(0) = 0 and VA € 2° {Ø}, 


P 2s cez gnc-a m (B)m,(C) 
ost) = I= > s,ce2jsnc-0 (B)m,(C) () 





The DS rule formula is commutative and associative and can be easily extended to the fusion 
of S > 2 BBAs. Unfortunately, DS rule has been highly disputed during the last decades by 
many authors because of its counter-intuitive behavior in high or even low conflict situations, 
and that is why many rules of combination were proposed in literature to combine BBAs [16]. 
To palliate DS rule drawbacks, the very interesting PCR6 was proposed in DSmT and it is usu- 
ally adopted (PCR6 rule coincides with PCR5 when combining only two BBAs [6]) in recent 
applications of DSmT. The fusion of two BBAs m,(.) and m,(.) by the PCR6 rule is obtained by 
Mpcro() = 0 and VA € D° \{9} 


Mocrs(A) = ma(A) 
m,(A)'m,(B) | m,(A)°m,(B) (2) 


. m,(A) + m,(B) ` m,(A) + m,(B) 





BeD®\{A}|A | 


where mi2(A) = Xg,cepejsn c-a ™1(B)m2(C) is the conjunctive operator, and each element A 
and B are expressed in their disjunctive normal form. If the denominator involved in the frac- 
tion is zero, then this fraction is discarded. The general PCR6 formula for combining more 
than two BBAs altogether is given in [6], Vol. 3. We adopt the generic notation m{$*°(.) = 
PCR6(m,(.), m,(.)) to denote the fusion of m,(.) and m2(.) by PCR6 rule. PCR6 is not associa- 
tive and PCR6 rule can also be applied in DST framework (with Shafer’s model of FoD) by 


replacing D? by 2° in Eq (2). 





PLOS ONE | https://doi.org/10.1371/journal.pone.0189703 January 19, 2018 3/25 


-PLOS | ONE 


Modified rigid coarsening (MRC) method 





Modified rigid coarsening for fusion of Bayesian BBAs 


Here, we introduce the principle of MRC of FoD to reduce the computational complexity of 
PCR6 combination of original Bayesian BBAs. Considering the case of non-Bayesian BBAs, it 
requires decoupling all non-singletons in these BBAs in advance, The fusion of original non- 
Bayesian BBAs needs to be decoupled by using DSmP in advance, which will explain in 
Section 4. 


Rigid coarsening 


This proposal was initially called rigid coarsening (RC) in our previous works [17-19] and cur- 
rently improved in our recent work [15]. The goal of this coarsening is to replace the original 
(refined) FoD © by a set of coarsened ones to make computation of the PCR6 rule tractable. 
Because we consider here only Bayesian BBA to combine, their focal elements are only single- 
tons of the FoD © £ {0,,...,0@,}, with n > 2, and we assume Shafer’s model of the FoD ©. A 
coarsening of the FoD © means to replace it with another FoD less specific of smaller dimen- 
sion Q = {@j, ..., wx} with k < n from the elements of ©. This can be done in many ways 
depending the problem under consideration. Generally, the elements of Q are singletons of ©, 
and disjunctions of elements of ©. For example, if © = {0), 02, 03, 04}, then a possible coarsened 
frame built from © could be, for instance, Q = {w, = 0), w = 05, w3 = 03 U 04}, or Q = {w; = 

0, U b2, w = 03 U 04}, ete. 

Definition 1: When dealing with Bayesian BBAs, the projection (For clarity and convenience, 
we put explicitly as upper index the FoD for which the belief mass refers) m(.) of the original 
BBA m®.) is simply obtained by taking 

m? (%,) = 5 m° (0,) (3) 


0,50; 





The rigid coarsening process is a simple dichotomous approach of coarsening obtained as 
follows: 


e Ifn = |O] is an even number: 
The disjunction of the n/2 first elements 6, to Oy of © define the element w; of Q, and the last 


n/2 elements 0,,, to 0„ of © define the element w of Q, that is 


n 
ztl 


Q£{0,=0, U... U On, 03 = On U... U Op} 


and based on Eq (3), one has 


m? (wy) = 5 m° (0,) (5) 
j=$tl pag n 
For example, if © = {6,, 02, 03, 04}, and one considers the Bayesian BBA m®°(6,) = 0.1, 
m®°(0,) = 0.2, m? (03) = 0.3 and m°(6,4) = 0.4, then Q = {w, = 0, U bz, wz = 03 U 04} and 
m°(w,) = 0.1 + 0.2 = 0.3 and m®(w,) = 0.3 + 0.4 = 0.7. 


If n = |O] is an odd number: 
In this case, the element w; of the coarsened frame Q is the disjunction of the [n/2 + 1] (The 
notation [x] means the integer part of x) first elements of ©, and the element w, is the 
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Table 1. Three Bayesian BBAs for Example 1. 




















Focal elem. m9.) m§(.) me (.) 
0, 0.1 0.4 0 
b 0.2 0 0.1 
03 0.3 0.1 0.5 
0, 0.3 0.1 0.4 
05 0.1 | 0.4 0 














https://doi.org/10.1371/journal.pone.0189703.t001 


disjunction of other elements of ©. That is 


O4{@,=0,U...U Orga] Or = Oy 


and based on Eq (3), one has 


(7) 


For example, if © = {6,, 02, 03, 04, 05}, and one considers the Bayesian BBA m°(6,) = 0.1, 
m® (6) = 0.2, m? (03) = 0.3, m? (04) = 0.3 and m°(65) = 0.1, then Q = {w, = 0, U 02 U 43, w = 
6,U 05} and m°(w,) = 0.1 + 0.2 + 0.3 = 0.6 and m°(w,) = 0.3 + 0.1 = 0.4. 


Of course, the same coarsening strategy applies to all original BBAs m°(.), s = 1, .. .S of the 
S > 1 sources of evidence to work with less specific BBAs m®(.), s = 1, . . .S. The less specific 
BBAs (called coarsened BBAs by abuse of language) can then be combined with the PCR6 rule 
of combination according to formula Eq (2). This dichotomous coarsening method is repeated 
iteratively / times as schematically represented by a bintree. Here, we consider bintree only for 
simplicity, which means that the coarsened frame Q consists of two elements only. Of course a 
similar method can be used with tri-tree, quad-tree, etc. The last step of this hierarchical pro- 
cess is to calculate the combined (Bayesian) BBA of all focal elements according to the connec- 
tion weights of the bintree structure, where the number of layers l of the tree depends on the 
cardinality |O] of the original FoD ©. Specifically, the mass of each focal element is updated 
depending on the connection weights of link paths from root to terminal nodes. This principle 
is illustrated in details in the following example. 

Example 1: Let’s consider © = {6,, 02, 63, 04, 65}, and the following three Bayesian BBAs can 
be seen in Table 1: 

The rigid coarsening and fusion of BBAs is deduced from the following steps: 

Step 1: We define the bintree structure based on iterative half split of FoD as shown in Fig 1. 

The connecting weights are denoted as A, . . ., Ag. The elements of the frames Q; are defined 
as follows: 


e Atlayer/= 1:0, = {w, £0, U 0, U 03,0, 20, U 0;} 
e At layer l = 2: 


Q, = {0 £0, U 03, O1 £ 03, Oy, £ 01, On = 95} 


e At layer l= 3: Q, = {0 £ 01, @ £ 03} 





PLOS ONE | https://doi.org/10.1371/journal.pone.0189703 January 19, 2018 5/25 


©). PLOS | ONE Modified rigid coarsening (MRC) method 








Fig 1. Fusion of Bayesian BBAs using bintree coarsening for Example 1. 





https://doi.org/10.1371/journal.pone.0189703.g001 


Step 2: The BBAs of elements of the (sub-) frames Q; are obtained as follows: 


At layer l = 1, we use Eqs (6) and (7) because |O] = 5 is an odd number. Therefore, we get the 
BBAs in Table 2: 


e At layer l = 2: We work with the two subframes Q,, £ {@,,, @,.} and Q,, £ {@5,, Moo} of Q3 
with the BBAs in Tables 3 and 4: 
These mass values are obtained by the proportional redistribution of the mass of each focal 
element with respect to the mass of its parent focal element in the bin tree. For example, 
m5” (c,,) = 4/5 is derived by taking 

= ms (0,) + m? (0,) 04 4 

- m9(0,) + m9(0,) + m9(83) 0.5 5 








m3” (%4) 


Other masses of coarsening focal elements are computed similarly using this proportional 
redistribution method. 


At layer / = 3: We use again the proportional redistribution method which gives us the BBAs 
of the sub-frames Q; in Table 5: 


Step 3: The connection weights 4; are computed from the assignments of coarsening ele- 
ments. In each layer l, we fuse sequentially the three BBAs using PCR6 formula Eq (2). Because 
PCR6 fusion is not associative, we should apply the general PCR6 formula to get best results. 
Here we use sequential fusion to reduce the computational complexity even if the fusion result 
is approximate. More precisely, we compute at first mis" (.) = PCR6(m;"(.), ms"(.)) and 
then ma (.) = PCR6(m's"*™(.), m5’ (.)). Hence, we obtain the following connecting 


weights in the bintree: 


Table 2. The BBAs of elements of the sub-frames ©, for Example 1. 




















Focal elem. my (.) m? (.) m9 (.) 
o, £0, U 0, U 0, 0.6 0.5 0.6 
wo, £0, U 0, 0.4 0.5 0.4 





https://doi.org/10.1371/journal.pone.0189703.t002 
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Table 3. The BBAs of elements of the sub-frames Q3; for Example 1. 











Focal elem. mẹ” () ms? (.) mo! () 
oO, £0, U 4, E E A 
O £0; i i A 














https://doi.org/10.1371/journal.pone.0189703.t003 


Table 4. The BBAs of elements of the sub-frames ©, for Example 1. 











Focal elem. mẹ? (.) ms” (.) me (.) 
Oy, £0, 7 5 1 
On £0; i ; 0 














https://doi.org/10.1371/journal.pone.0189703.t004 


Table 5. The BBAs of elements of the sub-frames ©, for Example 1. 








Focal elem. m? (.) me (.) me (.) 
On = 9, 3 1 0 
O * 9, 2 0 1 














https://doi.org/10.1371/journal.pone.0189703.t005 


e At layer /= 1: 


A, = mags" (@,) = 0.6297 
> = min 10, = 0.3703 
e At layer l = 2: 
he: = Hite On = 0.4137 
he = mags ~ (0p) = 0.5863 
hye = mos ty) = 0.8121 
Ag = Mans? (Wy) = 0.1879 
e At layer l= 3: 
hy = Hey (Gn) = 0.3103 
hig. = maas (Os) = 0.6897 


Step 4: The final assignments of elements in original FoD © are calculated using the prod- 
uct of the connection weights of link paths from root (top) node to terminal nodes (leaves). 
We eventually get the combined and normalized Bayesian BBA: 


m°(0,) = Ay + Ag+ Ay = 0.6297 - 0.4137 - 0.3103 = 0.0808 


m®(0,) Ay “Ay «My = 0.6297 - 0.4137 - 0.6897 = 0.1797 
m°(0,) = A, +’, = 0.6297 - 0.5863 = 0.3692 
m®(0,) Ay “Ag = 0.3703 - 0.8121 = 0.3007 
m°(0,) = y+ Ag = 0.3703 - 0.1879 = 0.0696 
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Modified rigid coarsening 


One of the issues with RC described in the previous section is that no extra self-information of 
focal elements is embedded into the coarsening process. In this paper, the elements 6; selected to 
belong to the same group are determined using the consensus information drawn from the 
BBAs provided by the sources. Specifically, the degrees of disagreement between the provided 
sources on decisions (04, 02, ---, 0n) are first calculated using the belief-interval based distance 
dgr [20] to obtain disagreement vector. And then all focal elements in FoD are sorted in an 
ascending order. Finally, the simple dichotomous approach is utilized to hierarchical coarsen 
those Re-sorted focal elements. 

Calculating the disagreement vector. Let us consider several BBAs m®(-), (s = 1, ..., S) 
defined on same FoD © of cardinality |O| = n. The specific BBAs mo(.),i=1,...,n entirely 
focused on 6; are defined by m(0;) = 1, and for X 4 0; mg(X) = 0. 

Definition 2: The disagreement of opinions of two sources about 6; is defined as the L,-dis- 
tance between the dp; distances of the BBAs m$ (.), s = 1, 2 to me(.), which is expressed by 


D,2(;) £ |dgy(m?(-), mo, ()) — dar(my (-), 19,(+))| (8) 


Definition 3: The disagreement of opinions of S > 3 sources about 0; is defined as 


D,_s(6;) = SDD m ()) — dys (m+), mp, (-))| (9) 


i=1 j=1 


where dp; distance is defined by [20] and proof of Definition 3 is given in S1 Appendix. For sim- 
plicity, we assume Shafer’s model so that |29| = 2”, otherwise the number of elements in the sum- 
mation of Eq (10) should be |D°| — 1 with another normalization constant ne. 





dg, omm) & a]. ld (81 (0), BLOD (10) 


Here, n, = 1/2” is the normalization constant and d'({a, b], [c, d]) is the Wasserstein’s distance 
defined by d! (|a, b], {c,d]) = VE e+e ? + 3 [be — de) *_ And BI(0)) = [Bel(0;), PIO). 
The disagreement vector D,_, is defined by 


D,_5=[D,_s(4:),---,Di-s(9,)] (11) 








Modified rigid coarsening by using the disagreement vector. Once D,_s is derived, all 
focal elements {6;, 02, -- -, On} are sorted according to their corresponding values in D,_s. 

Let us revisit example 1 presented in the previous section. It can be verified in applying for- 
mula Eq (9) that the disagreement vector D,_; for this example is equal to 


D,_, = [0.4085, 0.2156, 0.3753, 0.2507, 0.4086] 


The derivation of D,_3(0;) is given below for convenience. 


D,_3(0,) = |da (m? C), mo, (01)) — dg (m>(-), mo, (9,))| 
+|dr (m9 (+), mo, (0,)) (m3(-), mo, (9,))| 
0 


+|dp)(mpP(-), mo, (01)) m3 (+), ma, (01))| 
= 0.4085. 


dgr 
dy( 





Based on the disagreement vector, a new bintree structure is obtained and shown in Fig 2. 
Compared with Fig 1, the elements in FoD © are grouped more reasonably. In vector D1-3, 0; 
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0 0,0, 0, 6, 


Sorting 


0, 0, 0, 8 0, 












Fig 2. Fusion of Bayesian BBAs using MRC for Example 1. 
https://doi.org/10.1371/journal.pone.0189703.g002 





and @; lie in similar degree of disagreement so that they are put in the same group. Similarly 
for 0, and 04. However, element 03 seems weird, which is put alone in the process of coarsen- 
ing. Once this new bintree decomposition is obtained, other steps can be implemented which 
are identical to rigid coarsening in section to get the final combined BBA. 

Step 1: According to Fig 2, the elements of the frames Q; are defined as follows: 


e Atlayer/= 1:0, = {w, £0, U 0, U 03,0, 20, U 0;} 
e At layer] = 2:Q, = {@,, £ 0, U 04, @ £ 03, Wy, £ 01, oy = 05} 
At layer] = 3: Q, = {,,, £ 03, @ £ 0a} 
Step 2: The BBAs of elements of the (sub-) frames Q; are obtained as follows: 
e At layer l = 1, we use Eqs (6) and (7) and we get (Table 6) 


e At layer l = 2: We use again the proportional redistribution method which gives us Tables 7 
and 8. Here, masses of w21, w22 in mg? (.) are not considered because the mass of their parent 


focal element (m5 (œ,)) in bintree is 0. 


e At layer l = 3: We work with the two subframes Q, £ {@,,,, @12} of Q; with the BBAs in 
Table 9: 


Table 6. The BBAs of elements of the sub-frames Q, Using MRC for Example 1. 




















Focal elem. m” (.) m’ () m (6) 
w, £0, U 0, U 0, 0.8 0.2 1.0 
o, £0, U 0, 0.2 0.8 0.0 





https://doi.org/10.1371/journal.pone.0189703.t006 
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Table 7. The BBAs of elements of the sub-frames Q2; Using MRC for Example 1. 











Focal elem. mẹ” () ms?! (.) mo! () 
on £0, U 0, : A 2 
O15 2 0; 3 z 3 














https://doi.org/10.1371/journal.pone.0189703.t007 


Table 8. The BBAs of elements of the sub-frames Q22 Using MRC for Example 1. 





Focal elem. m?” () ms? () me (.) 





On, = 0, 





nie foie 
nope foie 


Doo 4 0; 


https://doi.org/10.1371/journal.pone.0189703.t008 














Table 9. The BBAs of elements of the sub-frames Q, Using MRC for Example 1. 











Focal elem. m” () me (.) m? () 
On £0, 2 0.0 p 
Ory) 0, 3 1.0 : 














https://doi.org/10.1371/journal.pone.0189703.t009 


Step 3: The connection weights 4; are computed from the assignments of coarsening ele- 
ments. Hence, we obtain the following connecting weights in the bintree: 


e At layer /= 1: 
i, = 0.8333; A, = 0.1667. 


At layer / = 2: 
A, = 0.5697; A, = 0.4303; 
à; = 0.5000; A, = 0.5000. 


e At layer l= 3: 
A, = 0.0669; A, = 0.9331; 


Step 4: We finally get the following combined and normalized Bayesian BBA 
m®(-) = {0.0833, 0.0318, 0.3586, 0.4430, 0.0834}. 


Summary of the proposed method 


The fusion method of BBAs to get a combined Bayesian BBA based on hierarchical decompo- 
sition of the FoD consists of several steps (Algorithm 1) below illustrated in Fig 3. It is worth 
noting that when the given BBAs are not Bayesian, the first step is to use the existing Probabi- 
listic Transformation (PT) to transform them to Bayesian BBAs. In order to use the proposed 
combination method in the RSs, modified rigid coarsening is mathematically denoted as @ in 
the following sections. 
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m; (+) is Bayesian? 


no 
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Product of path 
link weights 
Final Combined 
Bayesian BBA 


Fig 3. Modified rigid coarsening of FoD for fusion. 
https://doi.org/10.1371/journal.pone.0189703.g003 





Algorithm 1: Modified Rigid Coarsening Method 


Input: All original BBAs m9®(-),---,m®(-), S= 1, 2, =, S 
Output: The final combined BBA m? (-) 
1 if Compound focal elements in ©: 0;U0; Æ Ú or 8N- # ú then 

2 Probabilistic transformation: DSmP(m?(-)), DSmP(m8(-)),--+,DSmP(m9 (-)) 
3 end 

4 for i < n do 

5 for s < S do 
6 

7 

8 

9 





Calculate D,_,(6;)* Ea Dai |d,(m?(-), my, (.)) = dy(m>(-), my,(.))| 
end 
end 


for i < n do 





10 Sorting D,_;(0;) in an ascending order. 
11 end 

12 while |O©| > 2 do 

13 if n is an even number then 

14 

15 

16 

17 

18 

9 








20 Then connection weights A is calculated: PCR6 (mt (@,), m(w>) ) 

21 end 

22 foreach focal element 9;, i €e 1, ---, n do 

23 ~m°(@;) equals to the product of path link weights from root to ter- 
minal nodes. 

24 end 
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Fig 4. Accuracy comparisons between MRC and PCR6 (Only Singletons). 
https://doi.org/10.1371/journal.pone.0189703.g004 





Simulation considering accuracy and computational efficiency 


e Accuracy: 
Assuming that the FoD is © = {04, 02, 03, 04, 05, 06, 07, Os, Ao, O10 O11 O12 13» A143 O15» P16 9175 
Ois» 919, O20}, then 1000 BBAs are randomly generated to be fused with three methods: modi- 
fied rigid coarsening, rigid coarsening and also PCR6. And then distances of fusion results 
are computed using dg; between two pairs: modified rigid coarsening and PCR6; rigid coars- 
ening and PCR6. Comparisons are made in Fig 4, which show the superiority of our new 
approach proposed in this paper (The average value of the approximation of modified rigid 
coarsening is 97.5% and original rigid coarsening is 94.5%). Here, similarity represents the 
approximate degree between fusion results using hierarchical approximate method (both 
rigid and modified rigid coarsening) and PCR6. 


e Computational efficiency: 
As we mentioned before, another advantage of the hierarchical combination method is the 
computational efficiency. Here, two experiments are conducted (All experiments are imple- 
mented on a PC with I3 CPU, Integrated graphics chipsets and 4G DDR): 1) the number of sin- 
gletons is unchanged while the number of BBAs to be fused is increasing; 2) the number of 
BBAs is unchanged while the number of singletons in FoD is increasing. The results are illus- 
trated in Fig 5 and 6. From experiment 1, all these three methods (classical PCR6, rigid 
coarsening and also modified rigid coarsening) calculate quickly (less than 1.2s) even the 
number of BBAs increases from 100 to 1000. However, such situation deteriorates when the 
number of focal elements increases. In Fig 6, when the number of focal elements increases to 
500, time consumption of three combinations is: PCR6: 20.6857s; modified rigid coarsening: 
7.3320s; rigid coarsening: 5.9748s. This phenomenon also proves that it is reasonable to map 
original FoD to the coarsening FoD, with the aim of reducing the number of focal elements 
at the time of fusion. But in any case, computing efficiency of rigid coarsening or modified 
rigid coarsening is still better than PCR6. On the other hand, modified rigid coarsening 
makes a significant improvement (accuracy) at the expense of parts of the computational 
efficiency. 
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Computational time vs. Number of BBAs 
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Fig 5. Efficiency comparisons between MRC, RC and PCR6 (With the number of BBAs increasing). 


https://doi.org/10.1371/journal.pone.0189703.g005 
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Fig 6. Efficiency comparisons between MRC, RC and PCR6 (With the number of focal elements increasing). 
https://doi.org/10.1371/journal.pone.0189703.g006 


A recommender system integrating with hierarchical coarsening 
combination method 


In today’s e-commerce, online providers often recommend proper goods or services to each 
consumer based on their personal opinions or preferences [21], [22]. However, it is a tough 
task to provide appropriate recommendation which may confront several difficulties. One dif- 
ficulty is that users’ preferences are usually characterized as uncertain, imprecise or incomplete 
[23], [24], which cannot be used directly in RSs. Besides, it is easy to understand that when the 
more information about user preferences are, the more accurate prediction of RSs will be [25], 
[26]. But, the problem is that which method we adopt to integrate multi-source uncertain 
information? 

As a general framework for information fusion, DST can not only model uncertain infor- 
mation, but also provide an efficient way to combine multi-source information. These 
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mentioned features make this theory a wide range of applications [27-29], especially in RSs 
[23, 25, 30-32]. According to DST, users’ comments on products in RSs are described by using 
mass functions and rules of combination method are used frequently in order to provide 
appropriate recommendation. 

As mentioned in previous sections, both the performances of combination rules in DST or 
in DSmT suffer from computational complex which is obviously ignored in [23, 25]. Thus, in 
this paper, modified rigid coarsening method is applicable to combine the imprecise users’ 
preferences in RSs. First, we are required to introduce the relevant knowledge of RSs. Actually, 
almost all characteristics of RSs have been introduced in [23, 25, 30-32]. 

First, we give the corresponding representation of the mathematical notation in RSs based 
on DSmT. RSs usually contain two objects: {Users, Items}. A set of M users and a set containing 
N items is respectively denoted by U = {U}, U2, <- +, Uy} and I = {h, h, + + + In}. Besides, we 
assume that users can give the corresponding ratings to the items, which include L rating levels 
(© = {6,, 02, ---, Or}.). Here, L preference levels means multi-level evaluation results. For exam- 
ple, four-levels of user evaluation on the product are {Excellent, Good, Fair, Poor}. r; means a 
rating of user U; on item I, and a rating matrix R = {r;,} comprises all the ratings of users on 
items. It should be noted that r; is originally modeled as a mass function m; x: DÈ => [0; 1]. 
Additionally, let If and U; denote the set of items rated by user U; and the set of users having 
rated item Ip, respectively. 

Contextual information can often be summarized into several genres that significantly 
affect user’s rating of items. Normally, we represent contextual information by a set containing 
P genres, denoted by S = {S1, S2, - - + Sp}. And each genre Sp, with 1 < p < P contains at most Q 
groups, denoted by Sp = {8p,1> Sp.2 ` ` > Sp. ` > pah 1 < q < Q. Fora genre S, € S, a user 
U; € U can be interested in several groups and also an item I; € I can belong to one or some 
groups of this genre, which can be seen in Fig 7. 


——  Belongto 





=----> Interested in 


Fig 7. Contextual information. 





https://doi.org/10.1371/journal.pone.0189703.g007 
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Definition 4: In order to facilitate such expression, two functions x(-) and ọ(-) are defined to 
determine the groups in which user U; is interested and the groups to which item I; belongs, 


respectively: 
K, : U, +> x,(U,) CS, (12) 
pit > p) ES, (13) 


Generally, the main steps of a recommendation system is illustrated in Fig 8, which will be 
presented in details as follows: 


1. DSmT-Modeling Function 
Regarding the DS-partial probability models proposed in [23], the existing ratings r; x of 
user U; on item Ip are modeled by DSmT-modeling function M(-) in order to transform 
such hard ratings into the corresponding soft ratings represented as m; x as below: 
Definition 5: 


ot; (1 = Ti)» for A= 0; 


LALAN for A = B; 

Mi = 4 SKT for A=C; (14) 
1 — Qiks for A= O; 
0, otherwise. 
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Fig 8. General process of recommendations. 


https://doi.org/10.1371/journal.pone.0189703.g008 
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Fig 9. DSmT modeling function. 
https://doi.org/10.1371/journal.pone.0189703.g009 


with 
0, U 0, if l= 1; 
B=4 0,,U 6, if 1=L,; 
0 U 0, U Ona, otherwise. 


0, N b, if l= 1; 
C= 4 0 NO, if 1=L; 
(O N 0,0, O O41), otherwise. 


where aix € [0, 1] and o; x are a trust factor and a dispersion factor, respectively [23]. 
Referring to the partial probability model analysis in [23], we also give the corresponding 
user profiles which can be seen in Fig 9. Compared to [23], the difference is that we not 
only consider the union (black and gray rectangle), but also consider the intersection (red 
rectangle) of the hard ratings, which is also the distinction between DS theory and DSmT 
theory. 
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Lemma 1: Referring to Definition 5, we can also generate the relative refined BBA in the 
framework of DS theory: 


o,,(1 = Ti)» for A=0; 


06; Ts for A = B; 
T (15) 
1 — O33 for A= O; 
0, otherwise. 
with 
0, U 0,, if l=1; 
B= 4 0,,U 4, if l= L; 


0 U 0, U 0,,, otherwise. 


where æ; € [0, 1] and o;, are a trust factor and a dispersion factor, respectively [23]. 

After soft ratings are generated, DSmP [33] is applied to decouple non-Bayesian m; p since 
the hierarchical fusion algorithm is currently just available for Bayesian BBAs. 

Definition 6: DSmP is a new generalized pignistic transformation defined by DSmP,(0) = 0 
and for any singleton 0; € © by 


DSmP,(0,) & m(0,) + (m(0,) +e) 





mA (16) 
3 (A) 


A€2® OyCA,|A|>2 2inc2® sca s1 (B) +e: |Al 


As shown in [33], DSmP makes a remarkable improvement compared with BetP and 
CuzzP, since a more judicious redistribution of the ignorance masses to the singletons has 
been adopted by DSmP. 


2. Predicting unrated items: 
Assuming that users who are keen on the similar groups tend to have common preferences. 
In this RS, it is necessary to predict the unrated items first. Considering a group gp,q € Sp 
with gp, € p(x), every soft rating, m; x of user Uj, who is keen on group g,,,, on item I; is 
regarded as a block of common preference for group g,,q. Thus, Gm, DÊ — [0, 1] which 
represents all users’ group preferences on item J; regarding group gp, is computed as fol- 
lows 


Cie D Mix (17) 


gle 8p.q&p (Uj) &p.qEPp (Ik) } 


Supposing that item Ix has not been rated by user Uj, it usually contains three steps to gener- 
ate unprovided rating r; of user U; which are shown as below 


e Step one: Considering a concept Sp, for each group gpg E€ Kp(U;) N ppl), it is assumed that 
all users’ group preferences on item I, regarding group gp, imply common preference of 
U; on I, regarding group g,,,. Furthermore, this group preference is regarded as a piece of 
user U;’s concept preference on item J; regarding concept Sp. Therefore, concept prefer- 
ence of user U; on item J, regarding concept Sp, denoted by mass function Sm, D? = 
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[0, 1], can be computed as below 


+ Step two: If there exists at least one common group in concept Sp which item J; belongs to 
and also user U; is interested in, then U;’s concept preference on item I, regarding concept 
Sp is regarded as a piece of context preference. Therefore, this user’s contextual preference 
on item Ip denoted by mass function S,,,,: D? — [0, 1], is achieved as follows 


Swix = p Sipag (19) 


« Step three: Context preference of U; on item I, is assigned to unprovided rating m,, as 
below 


Mik 5 Smg (20) 


So far, all unprovided ratings are predicted in this RS. Subsequently, user-user similarities 
are computed depending on both provided and predicted ratings in the following steps. 


3. Computing user-user similarities: 
Here, we use the distance measure proposed in [34] to calculate distances between two 
users U; and U; with i 4 j, which is defined as below 





— a n max aO) — In min yO) 
D(U, U) = Sh Re m ma) E 


1 


where m; and m; are the soft ratings of user U; and user U; on item I; respectively. After- 


wards, the degree of similarity between U; and Uj, denoted by s;,, is calculated as follows 


s; = eP) where y € (0,00). (22) 


Obviously, if the value of s;; is high, it means the user U; and user U; are very close, and vice 
versa. Eventually, a mathematical matrix S = {s;;|U; U; € U, i + j} is employed to represent 
the similarities among all users. 


4. Selecting neighbors based on user-user similarities: 
Taking into account an active user Uj, for each unrated item I, by user Uj, a set containing 
K nearest neighborhoods, denoted by 9}, ,, is chosen by using the method proposed in [35]. 
Two simple steps of this method are shown below 


e Step one: the process of such selection depends on two criteria: 1. Those users who rated I;, 
and 2. The corresponding user-user similarities with user U; are equal or greater than the 
threshold zt. It; , denotes the selected set, which is acquired as follows: 


N = {U; € UF, € Frs > th (23) 


ij 
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+ Step two: all of members in 9è, , is descending sorted by s;j and top K members are selected 
as the neighborhood set Yt; ,.. 


5. Estimating ratings according to neighborhoods: 
Supposing that item I, has not been rated by user U;. The predicted rating of U; on item I 
is denoted as m,,. Thus, m;, is calculated according to the ratings of user U;’s nearest users. 


Mathematically, m,, is given as below 
Mix = Mix @ Mix (24) 


where m,, is the mass regarding the neighborhoods’ whole preference in the set Eq (23) on 
item Iņ. Considering user U, € Yt; „ and supposing that s;, is the similarity between user U; 
and user Uj. We use a discount rate 1 — s; j to discount the rating of user Uj on item Iy. 


Therefore, m, , is: 
M= D m;i (25) 


{j]UjENi k} 
sij Sj X m,,(A), for ACO; 
where mj, = 
Sij X m, (0) P (1 = S;;) if A=0. 


6. Generating recommendations: 
In order to generate appropriate recommendations for the candidate user Uj, predicted rat- 
ings of U; on all unprovided items are sorted, and then based on the sorted list, the appro- 
priate recommendations are generated. 


Experiments 


To evaluate the performance of modified rigid coarsening in precision of recommendation 
and computational time, original rigid coarsening method and also classical PCR6 combina- 
tion method are selected to be regarded as baselines. Besides, we use DS-MAE [23] to measure 
the precision of recommendations. 

Definition 7: DS-MAE is mathematically given as follows 


DS-MAE(O,) = 57> JO lôn(0) -MO (26) 


| | (i,k) ED;,0,<O 


where D; is the testing set identifying the user-item pairs whose true rating is 0; € ©. 

Those specific users’ interested information about genres is unknown. Thus, we define a 
rule that if a user has rated an item then this user is interested in all genres to which the item 
belongs. 


1. Experiment One: 
Movielens (http://grouplens.org/datasets/movielens) is a movie recommendation dataset 
widely used for benchmarking process. There are nearly 100,000 hard ratings on 19 differ- 
ent types of movies (Action, Comedy and so on). The domain of such rating given in 
Movielens includes 5 levels, denoted as © = {1, 2, 3, 4, 5},. At the same time, each user is 
required to evaluate at least 20 movies, so as to ensure adequate rating information. 
The relevant parameters used in RSs are setted: y = 10-* and V(i, k) {or 4 oig} = {0.9, 2/3}. 
However, Setting parameter T to be a fixed value is obviously unreasonable because the sim- 
ilarity between two users is quite different when using different combination methods. 
Hence, in this paper, the value of parameter t will not be setted in advance. Instead, it is 
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Fig 10. Overall DS-MAE between three combination methods. (Movielens). 


https://doi.org/10.1371/journal.pone.0189703.g010 





determined based on the similarity in matrix S. Specifically, the highest value of top 30% in 
S is selected for T. 

Additionally, we adopt the robust strategy of 10-fold cross validation to conduct experi- 
ments, which is widely applied in experimental verification. Specific steps are as follows: 
original ratings in Movielens are first randomly divided into 10-folds and the experiments 
are thus carried out 10 times: in each sub-experiment, nine tenths of the ratings are chosen 
as training data and the remaining ratings are regarded as testing data. It’s worth noting 
that all results illustrated in the following experiments are the average values of 10 times. 
Fig 10 demonstrates the values of overall DS-MAE varying with changing neighborhood 
size K. And the smaller values of DS-MAE indicate the better ones. As can be seen in Fig 10, 
with K < 70 performances of the three methods increase sharply as well as being the same 
as each other. With K > 70, performances of both methods become stable. Especially, per- 
formance of modified rigid coarsening method is very close to classical PCR6 rules. How- 
ever, original rigid coarsening is slightly worse than the other two algorithms. 

Fig 11 depicts the computational time varying with changing neighborhood size K. In this 
figure, the time taken by hierarchical coarsening combination methods (both rigid coarsen- 
ing and modified rigid coarsening method) is quite faster compared to classical PCR6. 
Besides, modified rigid coarsening is relatively slower than original rigid coarsening. All 
these results illustrate that modified rigid coarsening method sacrifices some of the compu- 
tational efficiency, in exchange for upgrading the accuracy of approximation. 


2. Experiment Two: 
Flixster (http://www.cs.ubc.ca/jamalim/datasets/) is a classical recommendation dataset 
which nearly contains 535013 hard ratings on 19 different types of movies (Drama, Comedy 
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Fig 11. Overall computational time between three combination methods. (Movielens). 
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and so on). The domain of such rating given in Flixster includes 10 levels, denoted as © = 
{0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0},. At the same time, each user is required to evalu- 
ate at least 15 movies, so as to ensure adequate rating information. The relevant parameters 
used in RSs are setted: y = 10°* and V(i, k){a,4, og} = {0.9, 2/3}. However, Setting parameter 
T to be a fixed value is obviously unreasonable because the similarity between two users is 
quite different when using different combination methods. Hence, in this paper, the value 
of parameter T will not be setted in advance. Instead, it is determined based on the similarity 
in matrix S. Specifically, the highest value of top 50% in S is selected for T. 

Fig 12 demonstrates the values of overall DS-MAE varying with changing neighborhood 
size K. And the smaller values of DS-MAE indicate the better ones. As can be seen in Fig 12, 
we can get a similar result to the previous data set(Movielens). Especially, performance of 
modified rigid coarsening method is in the middle of the comparison methods. However, 
original rigid coarsening is worse than the other two algorithms. Fig 13 depicts the compu- 
tational time varying with changing neighborhood size K. From this figure, we can also get 
the same conclusion that the time taken by hierarchical coarsening combination methods 
(both rigid coarsening and modified rigid coarsening method) is quite faster compared to 
classical PCRO. 
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Conclusion 


In this paper, we propose a new combination method, called modified rigid coarsening 
method. This new method can map the original refined FoD to the new coarsening FoD in the 
process of combination. Compared to traditional fusion method PCR6 in DSmT, this 
approach can not only reduce computational complexity, but also ensure high approximation 
accuracy. Besides, in order to verify the practicality of our approach, we apply this approach to 
fuse soft ratings in RSs. To be specific, user preferences are first transformed by DSmT-partial 
probability model to accurately represent uncertain information. Then, information about 
user preferences from different sources can be easily combined. In the future work, more help- 
ful information will be mined to discern focal element in FoD so as to improve the accuracy of 
approximation and more data sets will be applied. 
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Fig 13. Overall computational time between three combination methods. (Flixster). 
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